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Abstract 

The leaves of the Coriandrum sativum plant, known 
as cilantro or coriander, are widely used in many 
cuisines around the world. However, far from being 
a benign culinary herb, cilantro can be polarizing — 
many people love it while others claim that it tastes 
or smells foul, often like soap or dirt. This soapy 
or pungent aroma is largely attributed to several 
aldehydes present in cilantro. Cilantro preference is 
suspected to have a genetic component, yet to date 
nothing is known about specific mechanisms. Here 
we present the results of a genome-wide association 
study among 14,604 participants of European ances- 
try who reported whether cilantro tasted soapy, with 
replication in a distinct set of 11,851 participants who 
declared whether they liked cilantro. We find a single 
nucleotide polymorphism (SNP) significantly associ- 
ated with soapy-taste detection that is confirmed in 
the cilantro preference group. This SNP, rs72921001, 
(p = 6.4- 10~ 9 , odds ratio 0.81 per A allele) lies within 
a cluster of olfactory receptor genes on chromosome 
11. Among these olfactory receptor genes is OR6A2, 
which has a high binding specificity for several of the 
aldehydes that give cilantro its characteristic odor. 
We also estimate the heritability of cilantro soapy- 
taste detection in our cohort, showing that the heri- 
tability tagged by common SNPs is low, about 0.087. 
These results confirm that there is a genetic com- 



ponent to cilantro taste perception and suggest that 
cilantro dislike may stem from genetic variants in ol- 
factory receptors. We propose that OR6A2 may be 
the olfactory receptor that contributes to the detec- 
tion of a soapy smell from cilantro in European pop- 
ulations. 

Background 

The Coriandrum sativum plant has been cultivated 
since at least the 2nd millennium BCE [1]. Its fruits 
(commonly called coriander seeds) and leaves (called 
cilantro or coriander) are important components of 
many cuisines. In particular, South Asian cuisines 
use both the leaves and the seeds prominently, and 
Latin American food often incorporates the leaves. 

The desirability of cilantro has been debated for 
centuries. Pliny claimed that coriander had impor- 
tant medicinal properties: "vis magna ad refrigeran- 
dos ardores viridi" ("While green, it is possessed of 
very cooling and refreshing properties" ) [2] . The Ro- 
mans used the leaves and seeds in many dishes, in- 
cluding moretum (a herb, cheese, and garlic spread 
similar to today's pesto) [3]; the Mandarin word for 
cilantro, fj^tl (xiangcai), literally means "fragrant 
greens" . However, the leaves in particular have long 
inspired passionate hatred as well; e.g., John Gerard 
called it a "very stinking herbe" with leaves of "ven- 



emous quality" [4, 5]. 

It is not known why cilantro is so differentially per- 
ceived. The proportion of people who dislike cilantro 
varies widely by ancestry [6]; however, it is not clear 
to what extent this may be explained by differences 
in environmental factors, such as frequency of expo- 
sure. Genetics has been thought to play a role, but 
to date no studies have found genetic variants influ- 
encing cilantro taste preference. 

The smell of cilantro is often described as pun- 
gent or soapy. It is suspected, although not proven, 
that cilantro dislike is largely driven by the odor 
rather than the taste. The key aroma components 
in cilantro consist of various aldehydes, in particular 
(E)-2-alkenals and n-aldehydes [7, 8]. The unsatu- 
rated aldehydes (mostly decanal and dodecanal) in 
cilantro are described as fruity, green, and pungent; 
the (E)-2-alkenals (mostly (E)-2-dccenal and (E)-2- 
dodecenal) as soapy, fatty, "like cilantro" , or pungent 
[7, 8]. 

Several families of genes are important for taste 
and smell. The TAS1R and TAS2R families form 
sweet, umami, and bitter taste receptors [9, 10]. The 
olfactory receptor family contains about 400 func- 
tional genes in the human genome. Each receptor 
binds to a set of chemicals, enabling one to recognize 
specific odorants or tastants. Genetic differences in 
many of these receptors are known to play a role in 
how we perceive tastes and smells [11, 12, 13, 14]. 

Results and discussion 

Here we report the first ever genome-wide associ- 
ation study (GWAS) of cilantro soapy-taste detec- 
tion. Briefly, the GWAS was conducted in 14,604 
unrelated participants of primarily European ances- 
try who responded to an online questionnaire asking 
whether they thought cilantro tasted like soap (Ta- 
ble 1). Two single nucleotide polymorphisms (SNPs) 
were significant genome- wide (p < 5 • 10~ 8 ) in this 
population. One SNP, in a cluster of olfactory recep- 
tors, replicated in a non-overlapping group of 11,851 
participants (again, unrelated and of primarily Euro- 
pean ancestry) who reported whether they liked or 
disliked cilantro (see Methods for full details). Fig- 



N Female Age (SD) 

Tastes soapy 1994 0.566 49.0 (15.0) 

Doesn't taste soapy 12610 0.489 48.3 (15.2) 

Total 14604 0.500 48.4 (15.2) 

Dislikes cilantro 31~81 0.487 47.1 (16.6) 

Likes cilantro 8906 0.420 43.8 (14.5) 

Total 12087 0.438 44.7 (15.1) 



Table 1: Summary of the cohorts used in the analysis 

urc 1 shows p- values across the whole genome; Fig- 
ure 3 shows p-values near the most significant asso- 
ciations. A quantile-quantile plot (Figure 2) shows 
little (A = 1.007) global inflation of p-valucs. Index 
SNPs with p-values under 10 -6 are shown in Table 2 
(along with replication p- values) . 

We found one significant association for cilantro 
soapy-taste that was confirmed in the cilantro prefer- 
ence population. The SNP, rs72921001 (^discovery = 
6.4 • 10~ 9 , OR=0.81, p rcpl = 0.0057) lies on chro- 
mosome 11 within a cluster of eight olfactory recep- 
tor genes: OR2AG2, OR2AG1, OR6A2, OR10A5, 
OR10A2, OR10AI OR2D2, and OR2D3. The C al- 
lele is associated with both detecting a soapy smell 
and disliking cilantro. Of the olfactory receptors en- 
coded in this region, OR6A2 appears to be the most 
promising candidate underlying the association with 
cilantro odor detection. It is one of the most stud- 
ied olfactory receptors (often as the homologous ol- 
factory receptor 17 in rat) [15, 16, 17, 18]. A wide 
range of odorants have been found to activate this 
receptor, all of them aldehydes [16]. Among the un- 
saturated aldehydes, octanal binds the best to rat 
17 [17]; however, compounds ranging from hcptanal 
to undecanal also bind to this receptor [16]. Several 
singly unsaturated n-aldehydes also show high affin- 
ity, including (E)-2-decenal [16]. These aldehydes in- 
clude several of those playing a key role in cilantro 
aroma, such as decanal and (E)-2-decenal. Thus, 
this gene is particularly interesting as a candidate 
for cilantro odor detection. The index SNP is also 
in high LD (r 2 > 0.9) with three non-synonymous 
SNPs in OR10A2, namely rs3930075, rsl0839631, 
and rs7926083 (H43R, H207R, and K258T, respec- 
tively). Thus OR10A2 may also be a reasonable can- 
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Figure 1: Manhattan plot of association with cilantro soapy-taste Negative log 10 p-values across all 
SNPs tested. SNPs shown in red are genome-wide significant (p < 5 • 10" 8 ). Regions are named with the 
postulated candidate gene. 
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Table 2: Index SNPs for regions with p < 10~ 6 for cilantro-soapy taste The index SNP is defined 
as the SNP with the smallest p-value within a region. The listed gene is our postulated candidate gene near 
the SNP. Alleles are listed as major/minor (in Europeans). MAF is the frequency of the minor allele in 
Europeans and r 2 is the estimated imputation accuracy, ^discovery an d p IC p\ are the discovery and replication 
p- values, respectively. The OR is the discovery odds ratio per copy of the minor allele (e.g., the A allele of 
rs72921001 is the allele associated with a lower risk of detecting a soapy taste). 



Population Not soapy (%) 
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Ashkenazi 634 

South Europe 458 

Europe all 13213 

North Europe 11794 

All 16196 

African- American 545 

Latino 820 

East Asia 424 

South Asia 322 



(85.9%) 104 (14.1%) 

(86.6%) 71 (13.4%) 

(87.0%) 1973 (13.0%) 

(87.2%) 1736 (12.8%) 

(87.6%) 2299 (12.4%) 

(90.8%) 55 (9.2%) 

(91.3%) 78 (8.7%) 

(91.6%) 39 (8.4%) 

(96.1%) 13 (3.9%) 



Table 3: Cilantro soapy-taste by ancestry Number of people detecting a soapy taste by ancestry group, 
sorted from most to least soapy-taste detection. For reference, we have added the minor allele frequency 
of rs7107418 in each group. This SNP is a proxy for rs72921001 (r 2 > .98), with the minor G allele of 
rs7107418 corresponding to the minor A allele of rs72921001 (which is associated with less soapy tasting). 
The p-value is the p-value of association between soapy-taste and rs7107418 in each group. 



3 




Theoretical quantiles 



Figure 2: Quantile-quantile plot of association 
with cilantro soapy-taste Observed p-values ver- 
sus theoretical p- values under the null hypothesis of 
no association. The genomic control inflation factor 
for the study was 1.007 and is indicated by the red 
line; approximate 95% confidence intervals are given 
by the blue curves. 



didate gene in this region. 

The second significant association, with 
rsll4184611 (^discovery = 3.2 • 10~ 8 , OR=0.68, 
Prcpi = 0.49), lies in an intron of the gene SNX9 
(sorting nexin-9). See Figure 3. SNX9 encodes 
a multifunctional protein involved in intracellular 
trafficking and membrane remodeling during en- 
docytosis [19]. It has no known function in taste 
or smell and did not show association with liking 
cilantro in the replication population. This SNP is 
located about 80kb upstream of SYNJ2, an inositol 
5-phosphatase thought to be involved in membrane 
trafficking and signal transduction pathways. In 
candidate gene studies, SYNJ2 SNPs were found 
to be associated with agreeableness and symptoms 
of depression in the elderly [20] and with cognitive 
abilities [21]. In mice, a Synj2 mutation causes 
recessive non-syndromic hearing loss [22]. Given 
recent evidence that the perception of flavor may be 
influenced by multiple sensory inputs (cf. [23, 24]) we 
cannot exclude the SYN J2-linked SNP as conveying 



a biologically meaningful association. While this 
SNP may be a false positive, it could also be the 
case that this SNP is associated only with detecting 
a soapy smell in cilantro (and not in liking cilantro). 

We have used two slightly different phenotypes 
in our discovery and replication, soapy-taste detec- 
tion and cilantro preference, which are correlated 
(r 2 0.33). Detection of a soapy taste is report- 
edly one of the major reasons people seem to dis- 
like cilantro. Despite having over 10,000 more peo- 
ple reporting cilantro preference, we have used soapy- 
taste detection as our primary phenotype because it 
is probably influenced by fewer environmental fac- 
tors. Indeed, we see a stronger effect of rs72921001 
on soapy-taste detection than on cilantro preference 
(OR of 0.81 versus 0.92). 

We find significant differences by sex and ancestral 
population in soapy-taste detection (Tables 1 and 3). 
Women are more likely to detect a soapy taste (and to 
dislike cilantro) (OR for soapy-taste detection 1.36, 
p = 2.5 • 10~ 10 ), Table 1. African- Americans, Lati- 
nos, East Asians, and South Asians are all signifi- 
cantly less likely to detect a soapy taste compared 
to Europeans (ORs of 0.676, 0.637, 0.615, and 0.270 
respectively, p < 0.003), see Table 3. Ashkenazi Jews 
and South Europeans did not show significant dif- 
ferences from Northern Europeans (p = 0.84,0.65 
respectively). We tested the association between 
rs72921001 and soapy-taste detection within each 
population. Aside from the European populations, 
there was only a significant association in the small 
South Asian group (p = 0.0078, OR=0.18, 95% CI 
0.053-0.64). This association is in the same direc- 
tion as the association in Europeans. Note that the 
GWAS population in Table 1 is a subset of the "Eu- 
rope all" population in Table 3, filtered to remove 
relatives (Methods). While the differences in allele 
frequency across populations do not explain the dif- 
ferences in soapy-taste detection, our analysis does 
suggest that this SNP may affect soapy-taste detec- 
tion in non-European populations as well. 

We calculated the heritability for cilantro soapy- 
taste detection using the GCTA software [25]. Wc 
found a low heritability of 0.087 (p = 0.08, 95% CI 
(-0.037 - 0.211)). This estimate is a lower bound for 
the true heritability, as our estimate only takes into 
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account heritability due to SNPs genotyped in this 
study. While this calculation does not exclude a her- 
itability of zero, the existence of the association with 
rs72921001 does give a non-zero lower bound on the 
heritability Despite the strength of the association of 
the SNP near OR6A2, it explains only about 0.5% of 
the variance in perceiving that cilantro tastes soapy. 

There are a few possible explanations for these her- 
itability numbers. It is possible that other genetic 
factors not detected here could influence cilantro pref- 
erence. For example, there could be rare variants not 
typed in this study (possibly in partial linkage dise- 
quilibrium with rs72921001) that have a larger effect 
on cilantro preference. Such rare variants could cause 
the true heritability of this phenotype to be larger 
than we have calculated. For example, the heritabil- 
ity of height is estimated to be about 0.8 however, the 
heritability tagged by common SNPs is calculated at 
about 0.45 [26]. On the other hand, there is still con- 
siderable room between the 0.5% variance explained 
by rs72921001 and the estimated heritability of 8.7%. 
Thus it is quite possible that cilantro preference could 
be polygenic, as many other complex traits are (e.g., 
[27]). Finally, it is possible that the heritability of 
cilantro preference is just rather low and that, aside 
from the association discovered here, there is not a 
strong genetic component to cilantro preference. We 
note that there can be epigenetic modifiers of taste as 
well, for example, food preferences can even be trans- 
mitted to the fetus in utero through the mother's diet 
[23]. 



158.3 
Position on chrB (Mb) 



Figure 3: Associations with cilantro soapy-taste 
near rs72921001 (A) and rsll4184611 (B) Col- 
ors depict the squared correlation (r 2 ) of each SNP 
with the most associated SNP (rs72921001, shown in 
purple). Gray indicates SNPs for which r 2 informa- 
tion was missing. 



Conclusions 

Through a GWAS, we have shown that a SNP, 
rs72921001, near a cluster of olfactory receptors is 
significantly associated with detecting a soapy taste 
to cilantro. One of the genes near this SNP encodes 
an olfactory receptor, OR6A2, that detects the alde- 
hydes that may make cilantro smell soapy and thus is 
a compelling candidate gene for the detection of the 
cilantro odors that give cilantro its divisive flavor. 
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Methods 

Subjects 

Participants were drawn from the customer base 
of 23andMc, Inc., a consumer genetics company. 
This cohort has been described in detail previously 
[14, 28]. Participants provided informed consent and 
participated in the research online, under a protocol 
approved by the external AAHRPP-accredited IRB, 
Ethical and Independent Review Services (E&I Re- 
view) . 

Phenotype data collection 

On the 23andMe website, participants contribute in- 
formation through a combination of research surveys 
(longer, more formal questionnaires) and research 
"snippets" (multiple-choice questions appearing as 
part of various 23andMe webpages). In this study, 
participants were asked two questions about cilantro 
via research snippets: 

• "Does fresh cilantro taste like soap to you?" 
(Yes/No/I'm not sure) 

• "Do you like the taste of fresh (not dried) 
cilantro?" (Yes/No/I'm not sure) 

Among all 23andMe customers, 18,495 answered the 
first question (as either yes or no), 29,704 the second, 
and 15,751 both. Participants also reported their age. 
Sex and ancestry were determined on the basis of 
their genetic data. From these answers, we chose a 
set of 14,604 participants who answered the "soapy" 
question for GWAS, and 11,851 who answered only 
the taste preference question for a replication set. 

In both the GWAS set and the replication set, all 
participants were of European ancestry. In either 
group, no two shared more than 700 cM of DNA 
identical by descent (IBD, approximately the lower 
end of sharing between a pair of first cousins). IBD 
was calculated using the methods described in [29]; 
the principal component analysis was performed as in 
[14]. To determine European and African- American 
ancestry, we used local-ancestry methods (as in [30] ) . 
Europeans had over 97% of their genome painted Eu- 
ropean, African-Americans had at least 10% African 



and at most 10% Asian ancestry. Other groups were 
built using anecstry informative markers trained on 
a subset of 23andMe customers who reported having 
four grandparents of a given ancestry. 

Genotyping 

Subjects were genotyped on one or more of three 
chips, two based on the Illumina HumanHap550+ 
BeadChip, the third based on the Illumina OmniEx- 
press-l- BeadChip. The platforms contained 586,916, 
584,942, and 1,008,948 SNPs. Totals of 291, 5,394, 
and 10,184 participants (for the GWAS population) 
were genotyped on the platforms, respectively. A to- 
tal of 1,265 individuals were genotyped on multiple 
chips. For all participants, we imputed genotypes in 
batches of 8,000-10,000 using Beagle and Minimac 
[31, 32, 33] against the August 2010 release of the 
1000 Genomes reference haplotypes [34], as described 
in [35]. 

A total of 11,914,767 SNPs were imputed. Of 
these, 7,356,559 met our thresholds of 0.001 minor 
allele frequency, average r 2 across batches of at least 
0.5, and minimum r 2 across batches of at least 0.3. 
The minimum r 2 requirement was added to filter out 
SNPs that imputed less well in the batches consisting 
of the less dense platform. Positions and alleles are 
given relative to the positive strand of build 37 of the 
human genome. 

Statistical analysis 

For the GWAS, p- values were calculated using a like- 
lihood ratio test for the genotype term in the logistic 
regression model 

Y — G + age + sex + pc\ + pc 2 + pc 3 + pc A + pc 5 , 

where Y is the vector of phenotypes (coded as 
l=thinks cilantro tastes soapy, 0=doesn't), G is 
the vector of genotypes (coded as a dosage 0-2 for 
the estimated number of minor alleles present), and 
pci,...,pcs are the projections onto the principal 
components. The same model was used for the 
replication, with the phenotype coded as l=dislikcs 
cilantro, 0=likes. We used the standard cutoff for 
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genome-wide significance of 5 • 1CP 8 to correct for 
the multiple tests in the GWAS. ORs and p- values 
for the differences in soapy-taste detection between 
sexes and population were calculated directly, with- 
out any covariates. Table 3 uses a proxy SNP for 
rs72921001, as our imputation was done only in Eu- 
ropeans, so we did not have data for rs72921001 in 
other populations. 

For the heritability calculations, we used the 
GCTA software [25]. The calculations were done on 
genotyped SNPs only within a group of 13,628 unre- 
lated Europeans. Unrelated filtering here was done 
using GCTA to remove individuals with estimated 
relatedness larger than 0.025. Thus, this group is 
slightly different from the GWAS set, as there relat- 
edness filtering there was done using IBD. We as- 
sumed a prevalence for soapy-taste detection of 0.13 
for the transformation of heritability from the 0-1 
scale to the liability scale. Otherwise, default op- 
tions were used. We calculated heritability for auto- 
somal and X chromosome SNPs separately; the es- 
timates were 0.0869 (standard error 0.0634, p-value 
0.0805) for autosomal SNPs and 2 • 10~ 6 (standard 
error 0.010753, p-value 0.5) for the X chromosome. 
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